Evaluating Multi-focus Natural Language Queries over Data Services
نویسندگان
چکیده
Natural language interfaces to data services will be a key technology to guarantee access to huge data repositories in an effortless way. This involves solving the complex problem of recognizing a relevant service or service composition given an ambiguous, potentially ungrammatical natural language question. As a first step toward this goal, we study methods for identifying the salient terms (or foci) in natural language questions, classifying the latter according to a taxonomy of services and extracting additional relevant information in order to route them to suitable data services. While current approaches deal with single-focus (and therefore single-domain) questions, we investigate multi-focus questions in the aim of supporting conjunctive queries over the data services they refer to. Since such complex queries have seldom been studied in the literature, we have collected an ad-hoc dataset, SeCo-600, containing 600 multi-domain queries annotated with a number of linguistic and pragmatic features. Our experiments with the dataset have allowed us to reach very high accuracy in different phases of query analysis, especially when adopting machine learning methods.
منابع مشابه
Efficient XML - to - SQL Query Translation : Where to Add the Intelligence ? ( Extended
Exporting XML views of relational data gives rise to the problem of translating XML queries into SQL. To date, the focus of most of the work in the published literature [9, 14, 20] has been on mechanisms for correctly translating complex XML queries into SQL queries, with less emphasis on evaluating the quality of the resulting SQL queries. The efficiency of the SQL queries generated by the tra...
متن کاملPMJoin: Optimizing Distributed Multi-way Stream Joins by Stream Partitioning
In emerging data stream applications, data sources are typically distributed. Evaluating multi-join queries over streams from different sources may incur large communication cost. As queries run continuously, the precious bandwidths would be aggressively consumed without careful optimization of operator ordering and placement. In this paper, we focus on the optimization of continuous multi-join...
متن کاملREHABROBO-QUERY: Answering Natural Language Queries about Rehabilitation Robotics Ontology on the Cloud
We introduce a novel method to answer natural language queries about rehabilitation robotics, over the formal ontology REHABROBO-ONTO. For that, (i) we design and develop a novel controlled natural language for rehabilitation robotics, called REHABROBO-CNL; (ii) we introduce translations of queries in REHABROBO-CNL into SPARQL queries, utilizing a novel concept of query description trees and de...
متن کاملAnswering Natural Language Queries about Rehabilitation Robotics Ontology on the Cloud
We introduce a novel method to answer natural language queries about rehabilitation robotics, over the formal ontology REHABROBO-ONTO. For that, (i) we design and develop a novel controlled natural language for rehabilitation robotics, called REHABROBO-CNL; (ii) we introduce translations of queries in REHABROBO-CNL into SPARQL queries, utilizing a novel concept of query description trees and de...
متن کاملEfficient Indexing and Querying over Syntactically Annotated Trees
Natural language text corpora are often available as sets of syntactically parsed trees. A wide range of expressive tree queries are possible over such parsed trees that open a new avenue in searching over natural language text. They not only allow for querying roles and relationships within sentences, but also improve search effectiveness compared to flat keyword queries. One major drawback of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012